skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Wing, Rod A"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Abstract BackgroundUniversal single-copy orthologs are the most conserved components of genomes. Although they are routinely used for studying evolutionary histories and assessing new assemblies, current methods do not incorporate information from available genomic data. ResultsHere, we first determine the influence of evolutionary history on universal gene content and find that across 11,098 genomes of plants, fungi, and animals comprising 2606 taxonomic groups, 215 groups significantly vary from their respective lineages in terms of BUSCO (Benchmarking Universal Single Copy Orthologs) completeness. Additionally, 169 groups display an elevated complement of duplicated orthologs, likely from ancestral whole genome duplication events. Secondly, we investigate the extent of taxonomic congruence in broad BUSCO-derived phylogenies. For 275 suitable families out of 543 tested, sites evolving at higher rates produce at most 23.84% more taxonomically concordant, and at least 46.15% less terminally variable phylogenies compared to lower-rate sites. We find that BUSCO concatenated and coalescent trees have comparable accuracy and conclude that higher rate sites from concatenated alignments produce the most congruent and least variable phylogenies. Finally, we show that undetected, yet pervasive BUSCO gene loss events lead to misrepresentations of assembly quality. To overcome this, we filter a Curated set of BUSCOs (CUSCOs) that provide up to 6.99% fewer false positives compared to the standard search and introduce novel methods for comparing assemblies using gene synteny. ConclusionsOverall, we highlight the importance of considering evolutionary histories during assembly evaluations and release the phyca software toolkit that reconstructs consistent phylogenies and offers more precise assembly assessments. 
    more » « less
    Free, publicly-accessible full text available December 1, 2026
  2. Abstract Background Events of gene fusion have been reported in several organisms. However, the general role of gene fusion as part of new gene origination remains unknown. Results We conduct genome-wide interrogations of four Oryza genomes by designing and implementing novel pipelines to detect fusion genes. Based on the phylogeny of ten plant species, we detect 310 fusion genes across four Oryza species. The estimated rate of origination of fusion genes in the Oryza genus is as high as 63 fusion genes per species per million years, which is fixed at 16 fusion genes per species per million years and much higher than that in flies. By RNA sequencing analysis, we find more than 44% of the fusion genes are expressed and 90% of gene pairs show strong signals of purifying selection. Further analysis of CRISPR/Cas9 knockout lines indicates that newly formed fusion genes regulate phenotype traits including seed germination, shoot length and root length, suggesting the functional significance of these genes. Conclusions We detect new fusion genes that may drive phenotype evolution in Oryza. This study provides novel insights into the genome evolution of Oryza. 
    more » « less
  3. Summary In grasses, two types of phased, small interfering RNAs (phasiRNAs) are expressed largely in young, developing anthers. They are 21 or 24 nucleotides (nt) in length and are triggered by miR2118 or miR2275, respectively. However, most of their functions and activities are not fully understood.We performed comparative genomic analysis of their source loci (PHAS) in fiveOryzagenomes and combined this with analysis of high‐throughput sRNA and degradome datasets. In total, we identified 8216 21‐PHASand 626 24‐PHASloci. Local tandem and segmental duplications mainly contributed to the expansion and supercluster distribution of the 21‐PHASloci. Despite their relatively conserved genomic positions,PHASsequences diverged rapidly, except for the miR2118/2275 target sites, which were under strong selection for conservation.We found that 21‐nt phasiRNAs with a 5′‐terminal uridine (U) demonstratedcis‐cleavage atPHASprecursors, and thesecis‐acting sites were also variable among close species. miR2118 could trigger phasiRNA production from its own antisense transcript and the derived phasiRNAs might reversibly regulate miR2118 precursors.We hypothesised that successful initiation of phasiRNA biogenesis is conservatively maintained, while phasiRNA products diverged quickly and are not individually conserved. In particular, phasiRNA production is under the control of multiple reciprocal regulation mechanisms. 
    more » « less